Picture for Igor Gitman

Igor Gitman

Llama-Nemotron: Efficient Reasoning Models

Add code
May 02, 2025
Viaarxiv icon

NeMo-Inspector: A Visualization Tool for LLM Generation Analysis

Add code
May 01, 2025
Viaarxiv icon

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

Add code
Apr 23, 2025
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Viaarxiv icon

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Add code
Oct 02, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Add code
Feb 15, 2024
Figure 1 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 2 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 3 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 4 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Viaarxiv icon

Confidence-based Ensembles of End-to-End Speech Recognition Models

Add code
Jun 27, 2023
Viaarxiv icon

Powerful and Extensible WFST Framework for RNN-Transducer Losses

Add code
Mar 18, 2023
Viaarxiv icon

Understanding the Role of Momentum in Stochastic Gradient Methods

Add code
Oct 30, 2019
Figure 1 for Understanding the Role of Momentum in Stochastic Gradient Methods
Figure 2 for Understanding the Role of Momentum in Stochastic Gradient Methods
Figure 3 for Understanding the Role of Momentum in Stochastic Gradient Methods
Figure 4 for Understanding the Role of Momentum in Stochastic Gradient Methods
Viaarxiv icon